Overview

Dataset statistics

Number of variables90
Number of observations59381
Missing cells226163
Missing cells (%)4.2%
Duplicate rows9769
Duplicate rows (%)16.5%
Total size in memory40.8 MiB
Average record size in memory720.0 B

Variable types

BOOL48
CAT35
NUM7

Reproduction

Analysis started2020-06-11 14:07:38.921731
Analysis finished2020-06-11 14:09:25.824722
Duration1 minute and 46.9 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Dataset has 9769 (16.5%) duplicate rows Duplicates
Medical_History_26 is highly correlated with Medical_History_25 and 1 other fieldsHigh correlation
Medical_History_25 is highly correlated with Medical_History_26 and 1 other fieldsHigh correlation
Medical_History_36 is highly correlated with Medical_History_25 and 1 other fieldsHigh correlation
Medical_Keyword_11 is highly correlated with Medical_History_37High correlation
Medical_History_37 is highly correlated with Medical_Keyword_11High correlation
Medical_Keyword_23 is highly correlated with Medical_History_33High correlation
Medical_History_33 is highly correlated with Medical_Keyword_23High correlation
Medical_Keyword_48 is highly correlated with Medical_History_6High correlation
Medical_History_6 is highly correlated with Medical_Keyword_48High correlation
Medical_History_1 has 8889 (15.0%) missing values Missing
Medical_History_10 has 58824 (99.1%) missing values Missing
Medical_History_15 has 44596 (75.1%) missing values Missing
Medical_History_24 has 55580 (93.6%) missing values Missing
Medical_History_32 has 58274 (98.1%) missing values Missing
Medical_History_1 has 4789 (8.1%) zeros Zeros
Medical_History_15 has 2135 (3.6%) zeros Zeros
Medical_History_24 has 769 (1.3%) zeros Zeros
Medical_History_32 has 744 (1.3%) zeros Zeros

Variables

Medical_History_1
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count171
Unique (%)0.3%
Missing8889
Missing (%)15.0%
Infinite0
Infinite (%)0.0%
Mean7.962172225303019
Minimum0.0
Maximum240.0
Zeros4789
Zeros (%)8.1%
Memory size463.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median4
Q39
95-th percentile27
Maximum240
Range240
Interquartile range (IQR)7

Descriptive statistics

Standard deviation13.02769726
Coefficient of variation (CV)1.636198877
Kurtosis51.13486082
Mean7.962172225
Median Absolute Deviation (MAD)3
Skewness5.63523878
Sum402026
Variance169.7208958
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1765712.9%
 
257279.6%
 
047898.1%
 
347228.0%
 
436686.2%
 
531495.3%
 
630175.1%
 
720823.5%
 
1217382.9%
 
816182.7%
 
Other values (161)1232520.8%
 
(Missing)888915.0%
 
ValueCountFrequency (%) 
047898.1%
 
1765712.9%
 
257279.6%
 
347228.0%
 
436686.2%
 
ValueCountFrequency (%) 
2403< 0.1%
 
2391< 0.1%
 
2291< 0.1%
 
2281< 0.1%
 
2231< 0.1%
 

Medical_History_2
Real number (ℝ≥0)

Distinct count579
Unique (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean253.98710025092203
Minimum1
Maximum648
Zeros0
Zeros (%)0.0%
Memory size463.9 KiB

Quantile statistics

Minimum1
5-th percentile16
Q1112
median162
Q3418
95-th percentile579
Maximum648
Range647
Interquartile range (IQR)306

Descriptive statistics

Standard deviation178.6211541
Coefficient of variation (CV)0.7032686065
Kurtosis-0.931750075
Mean253.9871003
Median Absolute Deviation (MAD)99
Skewness0.5939859107
Sum15082008
Variance31905.51667
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1121102718.6%
 
162834314.0%
 
49157979.8%
 
33530325.1%
 
16129425.0%
 
1629204.9%
 
26123033.9%
 
47819573.3%
 
314422.4%
 
62811862.0%
 
Other values (569)1843231.0%
 
ValueCountFrequency (%) 
124< 0.1%
 
29< 0.1%
 
314422.4%
 
55< 0.1%
 
61< 0.1%
 
ValueCountFrequency (%) 
6481< 0.1%
 
6471< 0.1%
 
6461< 0.1%
 
6451< 0.1%
 
6441< 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
53306
3
 
6071
1
 
4
ValueCountFrequency (%) 
25330689.8%
 
3607110.2%
 
14< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
38887
1
20494
ValueCountFrequency (%) 
23888765.5%
 
12049434.5%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
58946
2
 
433
3
 
2
ValueCountFrequency (%) 
15894699.3%
 
24330.7%
 
32< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_6
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
56111
1
 
3268
2
 
2
ValueCountFrequency (%) 
35611194.5%
 
132685.5%
 
22< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
57608
3
 
1251
1
 
522
ValueCountFrequency (%) 
25760897.0%
 
312512.1%
 
15220.9%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
54225
3
 
3887
1
 
1269
ValueCountFrequency (%) 
25422591.3%
 
338876.5%
 
112692.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
45712
1
13665
3
 
4
ValueCountFrequency (%) 
24571277.0%
 
11366523.0%
 
34< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_10
Real number (ℝ≥0)

MISSING

Distinct count103
Unique (%)18.5%
Missing58824
Missing (%)99.1%
Infinite0
Infinite (%)0.0%
Mean141.1184919210054
Minimum0.0
Maximum240.0
Zeros75
Zeros (%)0.1%
Memory size463.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q18
median229
Q3240
95-th percentile240
Maximum240
Range240
Interquartile range (IQR)232

Descriptive statistics

Standard deviation107.7595593
Coefficient of variation (CV)0.7636104798
Kurtosis-1.771628535
Mean141.1184919
Median Absolute Deviation (MAD)11
Skewness-0.3099682123
Sum78603
Variance11612.12263
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2402700.5%
 
0750.1%
 
122< 0.1%
 
213< 0.1%
 
510< 0.1%
 
86< 0.1%
 
45< 0.1%
 
35< 0.1%
 
1204< 0.1%
 
1114< 0.1%
 
Other values (93)1430.2%
 
(Missing)5882499.1%
 
ValueCountFrequency (%) 
0750.1%
 
122< 0.1%
 
213< 0.1%
 
35< 0.1%
 
45< 0.1%
 
ValueCountFrequency (%) 
2402700.5%
 
2381< 0.1%
 
2371< 0.1%
 
2362< 0.1%
 
2351< 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
59103
2
 
190
1
 
88
ValueCountFrequency (%) 
35910399.5%
 
21900.3%
 
1880.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
56018
3
 
3362
1
 
1
ValueCountFrequency (%) 
25601894.3%
 
333625.7%
 
11< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
52496
1
 
6883
2
 
2
ValueCountFrequency (%) 
35249688.4%
 
1688311.6%
 
22< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
57769
2
 
1356
1
 
256
ValueCountFrequency (%) 
35776997.3%
 
213562.3%
 
12560.4%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_15
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count241
Unique (%)1.6%
Missing44596
Missing (%)75.1%
Infinite0
Infinite (%)0.0%
Mean123.76097396009469
Minimum0.0
Maximum240.0
Zeros2135
Zeros (%)3.6%
Memory size463.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q117
median117
Q3240
95-th percentile240
Maximum240
Range240
Interquartile range (IQR)223

Descriptive statistics

Standard deviation98.51620609
Coefficient of variation (CV)0.7960199644
Kurtosis-1.685713871
Mean123.760974
Median Absolute Deviation (MAD)114
Skewness0.01707845706
Sum1829806
Variance9705.442863
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
24045667.7%
 
021353.6%
 
12370.4%
 
21320.2%
 
121320.2%
 
141220.2%
 
131100.2%
 
151100.2%
 
31090.2%
 
16870.1%
 
Other values (231)704511.9%
 
(Missing)4459675.1%
 
ValueCountFrequency (%) 
021353.6%
 
12370.4%
 
21320.2%
 
31090.2%
 
4740.1%
 
ValueCountFrequency (%) 
24045667.7%
 
23925< 0.1%
 
238400.1%
 
237340.1%
 
236320.1%
 
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
49656
3
 
9724
2
 
1
ValueCountFrequency (%) 
14965683.6%
 
3972416.4%
 
21< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
58076
2
 
1304
1
 
1
ValueCountFrequency (%) 
35807697.8%
 
213042.2%
 
11< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
56212
2
 
3159
3
 
10
ValueCountFrequency (%) 
15621294.7%
 
231595.3%
 
310< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
57340
2
 
2036
3
 
5
ValueCountFrequency (%) 
15734096.6%
 
220363.4%
 
35< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
58493
1
 
887
3
 
1
ValueCountFrequency (%) 
25849398.5%
 
18871.5%
 
31< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
52913
2
 
6464
3
 
4
ValueCountFrequency (%) 
15291389.1%
 
2646410.9%
 
34< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
58291
1
 
1090
ValueCountFrequency (%) 
25829198.2%
 
110901.8%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
45370
1
14010
2
 
1
ValueCountFrequency (%) 
34537076.4%
 
11401023.6%
 
21< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_24
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count227
Unique (%)6.0%
Missing55580
Missing (%)93.6%
Infinite0
Infinite (%)0.0%
Mean50.63562220468298
Minimum0.0
Maximum240.0
Zeros769
Zeros (%)1.3%
Memory size463.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median8
Q364
95-th percentile240
Maximum240
Range240
Interquartile range (IQR)63

Descriptive statistics

Standard deviation78.14906865
Coefficient of variation (CV)1.543361476
Kurtosis0.9684008811
Mean50.6356222
Median Absolute Deviation (MAD)8
Skewness1.557421322
Sum192466
Variance6107.276931
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
07691.3%
 
14080.7%
 
2403380.6%
 
21930.3%
 
31630.3%
 
4940.2%
 
6930.2%
 
5880.1%
 
12850.1%
 
8680.1%
 
Other values (217)15022.5%
 
(Missing)5558093.6%
 
ValueCountFrequency (%) 
07691.3%
 
14080.7%
 
21930.3%
 
31630.3%
 
4940.2%
 
ValueCountFrequency (%) 
2403380.6%
 
2393< 0.1%
 
2384< 0.1%
 
2372< 0.1%
 
2364< 0.1%
 

Medical_History_25
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
48040
2
11105
3
 
236
ValueCountFrequency (%) 
14804080.9%
 
21110518.7%
 
32360.4%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_26
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
48041
2
11337
1
 
3
ValueCountFrequency (%) 
34804180.9%
 
21133719.1%
 
13< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
58790
1
 
584
2
 
7
ValueCountFrequency (%) 
35879099.0%
 
15841.0%
 
27< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
55393
2
 
3985
3
 
3
ValueCountFrequency (%) 
15539393.3%
 
239856.7%
 
33< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
45802
1
13576
2
 
3
ValueCountFrequency (%) 
34580277.1%
 
11357622.9%
 
23< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
56952
3
 
2425
1
 
4
ValueCountFrequency (%) 
25695295.9%
 
324254.1%
 
14< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
58943
1
 
437
2
 
1
ValueCountFrequency (%) 
35894399.3%
 
14370.7%
 
21< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_32
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count95
Unique (%)8.6%
Missing58274
Missing (%)98.1%
Infinite0
Infinite (%)0.0%
Mean11.965672990063235
Minimum0.0
Maximum240.0
Zeros744
Zeros (%)1.3%
Memory size463.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile85.8
Maximum240
Range240
Interquartile range (IQR)2

Descriptive statistics

Standard deviation38.71877434
Coefficient of variation (CV)3.235820866
Kurtosis19.74060953
Mean11.96567299
Median Absolute Deviation (MAD)0
Skewness4.342796359
Sum13246
Variance1499.143486
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
07441.3%
 
1730.1%
 
2350.1%
 
323< 0.1%
 
24015< 0.1%
 
515< 0.1%
 
611< 0.1%
 
1311< 0.1%
 
411< 0.1%
 
109< 0.1%
 
Other values (85)1600.3%
 
(Missing)5827498.1%
 
ValueCountFrequency (%) 
07441.3%
 
1730.1%
 
2350.1%
 
323< 0.1%
 
411< 0.1%
 
ValueCountFrequency (%) 
24015< 0.1%
 
2271< 0.1%
 
2191< 0.1%
 
2181< 0.1%
 
1841< 0.1%
 

Medical_History_33
Categorical

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
53580
1
 
5801
ValueCountFrequency (%) 
35358090.2%
 
158019.8%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
50148
1
 
9230
2
 
3
ValueCountFrequency (%) 
35014884.5%
 
1923015.5%
 
23< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
59319
3
 
60
2
 
2
ValueCountFrequency (%) 
15931999.9%
 
3600.1%
 
22< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_36
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
47358
3
11340
1
 
683
ValueCountFrequency (%) 
24735879.8%
 
31134019.1%
 
16831.2%
 

Length

Max length1
Median length1
Mean length1
Min length1

Medical_History_37
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
2
55719
1
 
3660
3
 
2
ValueCountFrequency (%) 
25571993.8%
 
136606.2%
 
32< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
59093
2
 
288
ValueCountFrequency (%) 
15909399.5%
 
22880.5%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
54354
1
 
5025
2
 
2
ValueCountFrequency (%) 
35435491.5%
 
150258.5%
 
22< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
3
58418
1
 
961
2
 
2
ValueCountFrequency (%) 
35841898.4%
 
19611.6%
 
22< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
1
40347
3
19033
2
 
1
ValueCountFrequency (%) 
14034767.9%
 
31903332.1%
 
21< 0.1%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
56887
1
 
2494
ValueCountFrequency (%) 
05688795.8%
 
124944.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58850
1
 
531
ValueCountFrequency (%) 
05885099.1%
 
15310.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
56455
1
 
2926
ValueCountFrequency (%) 
05645595.1%
 
129264.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58517
1
 
864
ValueCountFrequency (%) 
05851798.5%
 
18641.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58869
1
 
512
ValueCountFrequency (%) 
05886999.1%
 
15120.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58633
1
 
748
ValueCountFrequency (%) 
05863398.7%
 
17481.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58555
1
 
826
ValueCountFrequency (%) 
05855598.6%
 
18261.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58763
1
 
618
ValueCountFrequency (%) 
05876399.0%
 
16181.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58986
1
 
395
ValueCountFrequency (%) 
05898699.3%
 
13950.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
57216
1
 
2165
ValueCountFrequency (%) 
05721696.4%
 
121653.6%
 

Medical_Keyword_11
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
55936
1
 
3445
ValueCountFrequency (%) 
05593694.2%
 
134455.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58787
1
 
594
ValueCountFrequency (%) 
05878799.0%
 
15941.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
59027
1
 
354
ValueCountFrequency (%) 
05902799.4%
 
13540.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58915
1
 
466
ValueCountFrequency (%) 
05891599.2%
 
14660.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
48071
1
11310
ValueCountFrequency (%) 
04807181.0%
 
11131019.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58626
1
 
755
ValueCountFrequency (%) 
05862698.7%
 
17551.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58837
1
 
544
ValueCountFrequency (%) 
05883799.1%
 
15440.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58936
1
 
445
ValueCountFrequency (%) 
05893699.3%
 
14450.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58829
1
 
552
ValueCountFrequency (%) 
05882999.1%
 
15520.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58898
1
 
483
ValueCountFrequency (%) 
05889899.2%
 
14830.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58514
1
 
867
ValueCountFrequency (%) 
05851498.5%
 
18671.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
57174
1
 
2207
ValueCountFrequency (%) 
05717496.3%
 
122073.7%
 

Medical_Keyword_23
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
53575
1
 
5806
ValueCountFrequency (%) 
05357590.2%
 
158069.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58259
1
 
1122
ValueCountFrequency (%) 
05825998.1%
 
111221.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
54069
1
 
5312
ValueCountFrequency (%) 
05406991.1%
 
153128.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58583
1
 
798
ValueCountFrequency (%) 
05858398.7%
 
17981.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58677
1
 
704
ValueCountFrequency (%) 
05867798.8%
 
17041.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58494
1
 
887
ValueCountFrequency (%) 
05849498.5%
 
18871.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58683
1
 
698
ValueCountFrequency (%) 
05868398.8%
 
16981.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
57894
1
 
1487
ValueCountFrequency (%) 
05789497.5%
 
114872.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58734
1
 
647
ValueCountFrequency (%) 
05873498.9%
 
16471.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58124
1
 
1257
ValueCountFrequency (%) 
05812497.9%
 
112572.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58025
1
 
1356
ValueCountFrequency (%) 
05802597.7%
 
113562.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58155
1
 
1226
ValueCountFrequency (%) 
05815597.9%
 
112262.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58969
1
 
412
ValueCountFrequency (%) 
05896999.3%
 
14120.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58763
1
 
618
ValueCountFrequency (%) 
05876399.0%
 
16181.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
55427
1
 
3954
ValueCountFrequency (%) 
05542793.3%
 
139546.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58975
1
 
406
ValueCountFrequency (%) 
05897599.3%
 
14060.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58570
1
 
811
ValueCountFrequency (%) 
05857098.6%
 
18111.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
55999
1
 
3382
ValueCountFrequency (%) 
05599994.3%
 
133825.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58784
1
 
597
ValueCountFrequency (%) 
05878499.0%
 
15971.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
56677
1
 
2704
ValueCountFrequency (%) 
05667795.4%
 
127044.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58745
1
 
636
ValueCountFrequency (%) 
05874598.9%
 
16361.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58934
1
 
447
ValueCountFrequency (%) 
05893499.2%
 
14470.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58568
1
 
813
ValueCountFrequency (%) 
05856898.6%
 
18131.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58877
1
 
504
ValueCountFrequency (%) 
05887799.2%
 
15040.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
58199
1
 
1182
ValueCountFrequency (%) 
05819998.0%
 
111822.0%
 

Medical_Keyword_48
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size463.9 KiB
0
56145
1
 
3236
ValueCountFrequency (%) 
05614594.6%
 
132365.4%
 

Response
Real number (ℝ≥0)

Distinct count8
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.636836698607299
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size463.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q14
median6
Q38
95-th percentile8
Maximum8
Range7
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.45683306
Coefficient of variation (CV)0.4358531552
Kurtosis-0.8222274182
Mean5.636836699
Median Absolute Deviation (MAD)2
Skewness-0.7746691513
Sum334721
Variance6.036028687
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
81948932.8%
 
61123318.9%
 
7802713.5%
 
2655211.0%
 
1620710.5%
 
554329.1%
 
414282.4%
 
310131.7%
 
ValueCountFrequency (%) 
1620710.5%
 
2655211.0%
 
310131.7%
 
414282.4%
 
554329.1%
 
ValueCountFrequency (%) 
81948932.8%
 
7802713.5%
 
61123318.9%
 
554329.1%
 
414282.4%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

Medical_History_1Medical_History_2Medical_History_3Medical_History_4Medical_History_5Medical_History_6Medical_History_7Medical_History_8Medical_History_9Medical_History_10Medical_History_11Medical_History_12Medical_History_13Medical_History_14Medical_History_15Medical_History_16Medical_History_17Medical_History_18Medical_History_19Medical_History_20Medical_History_21Medical_History_22Medical_History_23Medical_History_24Medical_History_25Medical_History_26Medical_History_27Medical_History_28Medical_History_29Medical_History_30Medical_History_31Medical_History_32Medical_History_33Medical_History_34Medical_History_35Medical_History_36Medical_History_37Medical_History_38Medical_History_39Medical_History_40Medical_History_41Medical_Keyword_1Medical_Keyword_2Medical_Keyword_3Medical_Keyword_4Medical_Keyword_5Medical_Keyword_6Medical_Keyword_7Medical_Keyword_8Medical_Keyword_9Medical_Keyword_10Medical_Keyword_11Medical_Keyword_12Medical_Keyword_13Medical_Keyword_14Medical_Keyword_15Medical_Keyword_16Medical_Keyword_17Medical_Keyword_18Medical_Keyword_19Medical_Keyword_20Medical_Keyword_21Medical_Keyword_22Medical_Keyword_23Medical_Keyword_24Medical_Keyword_25Medical_Keyword_26Medical_Keyword_27Medical_Keyword_28Medical_Keyword_29Medical_Keyword_30Medical_Keyword_31Medical_Keyword_32Medical_Keyword_33Medical_Keyword_34Medical_Keyword_35Medical_Keyword_36Medical_Keyword_37Medical_Keyword_38Medical_Keyword_39Medical_Keyword_40Medical_Keyword_41Medical_Keyword_42Medical_Keyword_43Medical_Keyword_44Medical_Keyword_45Medical_Keyword_46Medical_Keyword_47Medical_Keyword_48Response
04.01122113221NaN3233240.033112123NaN1331323NaN1312213330000000000000000000000000000000000000000000000008
15.04122113221NaN32330.013112123NaN1331323NaN3112213310000000000000000000000000000000000000000000000004
210.032213222NaN3233NaN13112123NaN2231323NaN3313213310000000000000000000000000000000000000000000000008
30.03502213222NaN3233NaN13112223NaN1331323NaN3312213310000000000000000000000000000000100000000000000008
4NaN1622213222NaN3233NaN13112123NaN2231323NaN3313213310000000000000000000000000000000000000000000000008
56.04912213222NaN3233NaN13212223NaN1331323NaN3112213330000000000000000000001000000000001000000000000008
65.06003213221NaN3233NaN13112123NaN1331123NaN3312213330000000000000000000000000000000000000000000000008
76.01452213221NaN3233NaN13112123NaN1331323NaN3312213310000000000000000000000000000000000000000000000001
84.0162213221NaN3233NaN13112123NaN1331123NaN3312213330000000000000000000100000000000000000000000000008
9NaN1622213222NaN3233NaN33112121NaN1331323NaN3312213310000000000000010000000001000000000000000000000001

Last rows

Medical_History_1Medical_History_2Medical_History_3Medical_History_4Medical_History_5Medical_History_6Medical_History_7Medical_History_8Medical_History_9Medical_History_10Medical_History_11Medical_History_12Medical_History_13Medical_History_14Medical_History_15Medical_History_16Medical_History_17Medical_History_18Medical_History_19Medical_History_20Medical_History_21Medical_History_22Medical_History_23Medical_History_24Medical_History_25Medical_History_26Medical_History_27Medical_History_28Medical_History_29Medical_History_30Medical_History_31Medical_History_32Medical_History_33Medical_History_34Medical_History_35Medical_History_36Medical_History_37Medical_History_38Medical_History_39Medical_History_40Medical_History_41Medical_Keyword_1Medical_Keyword_2Medical_Keyword_3Medical_Keyword_4Medical_Keyword_5Medical_Keyword_6Medical_Keyword_7Medical_Keyword_8Medical_Keyword_9Medical_Keyword_10Medical_Keyword_11Medical_Keyword_12Medical_Keyword_13Medical_Keyword_14Medical_Keyword_15Medical_Keyword_16Medical_Keyword_17Medical_Keyword_18Medical_Keyword_19Medical_Keyword_20Medical_Keyword_21Medical_Keyword_22Medical_Keyword_23Medical_Keyword_24Medical_Keyword_25Medical_Keyword_26Medical_Keyword_27Medical_Keyword_28Medical_Keyword_29Medical_Keyword_30Medical_Keyword_31Medical_Keyword_32Medical_Keyword_33Medical_Keyword_34Medical_Keyword_35Medical_Keyword_36Medical_Keyword_37Medical_Keyword_38Medical_Keyword_39Medical_Keyword_40Medical_Keyword_41Medical_Keyword_42Medical_Keyword_43Medical_Keyword_44Medical_Keyword_45Medical_Keyword_46Medical_Keyword_47Medical_Keyword_48Response
593711.01122213222NaN3233NaN13112223NaN2231123NaN3313213330000000000000000000001000000000000000000000000006
593724.01122111222NaN3213NaN13112121NaN1331123NaN3312213330000000001000010000000000000000000000000010000012
593731.02612113222NaN3233182.013112123NaN1331123NaN3312213330000000000000000000000000000000000000000000000008
5937468.04912213222NaN3233NaN13112113NaN2231323NaN3313213310000000000000000000000000000000000000000000000007
59375NaN1622213222NaN3233NaN13112123NaN2231323NaN3113213310000000000000000000000000000000000001000000000008
593760.02612113222NaN323332.013112123NaN1331323NaN3312213330000000000000000000000000000000000000000000000004
5937724.04912213222NaN3233NaN13112123NaN2231323NaN3313213310000000000000000000000000000000000000000000000007
59378NaN1622213222NaN3233NaN13112123NaN2231323NaN3113213310000000000000000000000000000000000001000000000008
593790.0162113222NaN3213240.013112123NaN1331323NaN1312213330000000000000000000000100000000000000001000000008
59380NaN1623113222NaN3233NaN131121238.01331323NaN3312213310000000000000000000000000000000000000000000000007